The pure parsimony haplotyping problem: overview and computational advances
نویسندگان
چکیده
Haplotyping estimation from aligned Single Nucleotide Polymorphism (SNP) fragments has attracted more and more attention in the recent years due to its importance in analysis of many fine-scale genetic data. Its application fields range from mapping of complex disease genes to inferring population histories, passing through designing drugs, functional genomics and pharmacogenetics. The literature proposes a number of estimation criteria to select a set of haplotypes among possibles alternatives. Usually, such criteria can be expressed under the form of objective functions, and the sets of haplotypes that optimize them are referred to as optimal. One of the most important estimation criteria is the pure parsimony which states that the optimal set of haplotypes for a given set of genotypes is the one having minimal cardinality. Finding the minimal number of haplotypes necessary to explain a given set of genotypes involves solving an optimization problem, called the Pure Parsimony Haplotyping (PPH) estimation problem, which is notoriously NP-Hard. This article provides an overview of PPH, and discusses the different approaches to solution that occur in the literature.
منابع مشابه
1 Haplotype Inference
Fresh Pond Research Institute 1.1 Abstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-1 1.2 Introduction to Variation, SNPs, Genotypes, and Haplotypes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-2 The Biological Problem • The Computational Problems • The Need for a Genetic Model • Two Major Ap...
متن کاملApproximation algorithms for the minimum rainbow subgraph problem
Our research was motivated by the pure parsimony haplotyping problem: Given a set G of genotypes, the haplotyping problem consists in finding a set H of haplotypes that explains G. In the pure parsimony haplotyping problem (PPH) we are interested in finding a set H of smallest possible cardinality. The pure parsimony haplotyping problem can be described as a graph colouring problem as follows: ...
متن کاملComputational Experiments on Algorithms for Haplotype Inference Problems by Pure Parsimony
To analyze the function of DNA, researchers have to obtain each haplotype, the genetic constitution of an individual chromosome, of an individual for analysis. Due to the significant efforts required in collecting haplotypes, the descriptions of one conflated pair of haplotypes called genotypes are usually collected. Since the genotype data contains insufficient information to identify the comb...
متن کاملA Class Representative Model for Pure Parsimony Haplotyping
Haplotyping estimation from aligned Single Nucleotide Polymorphism (SNP) fragments has attracted more and more attention in the recent years due to its importance in analysis of many fine-scale genetic data. Its application fields range from mapping of complex disease genes to inferring population histories, passing through designing drugs, functional genomics and pharmacogenetics. The literatu...
متن کاملA Class Representative Model for Pure Parsimony Haplotyping under Uncertain Data
The Pure Parsimony Haplotyping (PPH) problem is a NP-hard combinatorial optimization problem that consists of finding the minimum number of haplotypes necessary to explain a given set of genotypes. PPH has attracted more and more attention in recent years due to its importance in analysis of many fine-scale genetic data. Its application fields range from mapping complex disease genes to inferri...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- ITOR
دوره 16 شماره
صفحات -
تاریخ انتشار 2009